智能论文笔记

Score-based Generative Models for Calorimeter Shower Simulation

Vinicius Mikuni , Benjamin Nachman

分类：机器学习

2022-06-17

基于分数的生成模型是一类新的生成算法，即使在高维空间中也可以产生逼真的图像，目前超过其他基准类别和应用程序的其他最新模型。在这项工作中，我们介绍了Caloscore，这是一种基于分数的生成模型，用于对量热计淋浴的应用。使用快速热量量表模拟挑战2022数据集研究了三个不同的扩散模型。Caloscore是基于分数的生成模型在对撞机物理学中的第一个应用，并且能够为所有数据集生成高保真量热计图像，为热量计淋浴模拟提供了替代范式。

translated by 谷歌翻译

Online-compatible Unsupervised Non-resonant Anomaly Detection

Vinicius Mikuni , Benjamin Nachman , David Shih

分类：机器学习

2021-11-11

对异常检测方法的需求不断增长，可以以模型 - 不可知的方式扩大对新颗粒的搜索。大多数新方法的建议专注于信号灵敏度。但是，选择异常事件是不够的 - 还必须有一个策略来为所选事件提供上下文。我们提出了无监督检测的第一个完整的策略，其包括信号灵敏度和用于背景估计的数据驱动方法。我们的技术由两个同时培训的autoencoders建造，被迫彼此去相关。该方法可以脱机用于非共振异常检测，也是第一个完整的在线兼容的异常检测策略。我们表明，我们的方法在为ADC2021数据挑战准备的各种信号上实现了出色的性能。

translated by 谷歌翻译

A comprehensive analysis of the Elo rating algorithm: Stochastic model, convergence characteristics, design guidelines, and experimental results

Daniel Gomes de Pinho Zanco , Leszek Szczecinski , Eduardo Vinicius Kuhn , Rui Seara

分类：机器学习 | 人工智能

2022-12-22

The Elo algorithm, due to its simplicity, is widely used for rating in sports competitions as well as in other applications where the rating/ranking is a useful tool for predicting future results. However, despite its widespread use, a detailed understanding of the convergence properties of the Elo algorithm is still lacking. Aiming to fill this gap, this paper presents a comprehensive (stochastic) analysis of the Elo algorithm, considering round-robin (one-on-one) competitions. Specifically, analytical expressions are derived characterizing the behavior/evolution of the skills and of important performance metrics. Then, taking into account the relationship between the behavior of the algorithm and the step-size value, which is a hyperparameter that can be controlled, some design guidelines as well as discussions about the performance of the algorithm are provided. To illustrate the applicability of the theoretical findings, experimental results are shown, corroborating the very good match between analytical predictions and those obtained from the algorithm using real-world data (from the Italian SuperLega, Volleyball League).

translated by 谷歌翻译

Toward Human-AI Co-creation to Accelerate Material Discovery

Dmitry Zubarev , Carlos Raoni Mendes , Emilio Vital Brazil , Renato Cerqueira , Kristin Schmidt , Vinicius Segura , Juliana Jansen Ferreira , Dan Sanders

分类：机器学习 | 人工智能

2022-11-05

There is an increasing need in our society to achieve faster advances in Science to tackle urgent problems, such as climate changes, environmental hazards, sustainable energy systems, pandemics, among others. In certain domains like chemistry, scientific discovery carries the extra burden of assessing risks of the proposed novel solutions before moving to the experimental stage. Despite several recent advances in Machine Learning and AI to address some of these challenges, there is still a gap in technologies to support end-to-end discovery applications, integrating the myriad of available technologies into a coherent, orchestrated, yet flexible discovery process. Such applications need to handle complex knowledge management at scale, enabling knowledge consumption and production in a timely and efficient way for subject matter experts (SMEs). Furthermore, the discovery of novel functional materials strongly relies on the development of exploration strategies in the chemical space. For instance, generative models have gained attention within the scientific community due to their ability to generate enormous volumes of novel molecules across material domains. These models exhibit extreme creativity that often translates in low viability of the generated candidates. In this work, we propose a workbench framework that aims at enabling the human-AI co-creation to reduce the time until the first discovery and the opportunity costs involved. This framework relies on a knowledge base with domain and process knowledge, and user-interaction components to acquire knowledge and advise the SMEs. Currently,the framework supports four main activities: generative modeling, dataset triage, molecule adjudication, and risk assessment.

translated by 谷歌翻译

Learning to Rank Graph-based Application Objects on Heterogeneous Memories

Diego Moura , Vinicius Petrucci , Daniel Mosse

分类：机器学习

2022-11-04

Persistent Memory (PMEM), also known as Non-Volatile Memory (NVM), can deliver higher density and lower cost per bit when compared with DRAM. Its main drawback is that it is typically slower than DRAM. On the other hand, DRAM has scalability problems due to its cost and energy consumption. Soon, PMEM will likely coexist with DRAM in computer systems but the biggest challenge is to know which data to allocate on each type of memory. This paper describes a methodology for identifying and characterizing application objects that have the most influence on the application's performance using Intel Optane DC Persistent Memory. In the first part of our work, we built a tool that automates the profiling and analysis of application objects. In the second part, we build a machine learning model to predict the most critical object within large-scale graph-based applications. Our results show that using isolated features does not bring the same benefit compared to using a carefully chosen set of features. By performing data placement using our predictive model, we can reduce the execution time degradation by 12\% (average) and 30\% (max) when compared to the baseline's approach based on LLC misses indicator.

translated by 谷歌翻译

A Robust Scientific Machine Learning for Optimization: A Novel Robustness Theorem

Luana P. Queiroz , Carine M. Rebello , Erber A. Costa , Vinicius V. Santana , Alirio E. Rodrigues , Ana M. Ribeiro , Idelfonso B. R. Nogueira

分类：机器学习

2022-09-13

科学机器学习（SCIML）是对几个不同应用领域的兴趣越来越多的领域。在优化上下文中，基于SCIML的工具使得能够开发更有效的优化方法。但是，必须谨慎评估和执行实施优化的SCIML工具。这项工作提出了稳健性测试的推论，该测试通过表明其结果尊重通用近似值定理，从而确保了基于多物理的基于SCIML的优化的鲁棒性。该测试应用于一种新方法的框架，该方法在一系列基准测试中进行了评估，以说明其一致性。此外，将提出的方法论结果与可行优化的可行区域进行了比较，这需要更高的计算工作。因此，这项工作为保证在多目标优化中应用SCIML工具的稳健性测试提供了比存在的替代方案要低的计算努力。

translated by 谷歌翻译

Multiresolution Neural Networks for Imaging

Hallison Paz , Tiago Novello , Vinicius Silva , Luiz Schirmer , Guilherme Schardong , Luiz Velho

分类：计算机视觉 | 机器学习

2022-08-25

我们介绍MR-NET，这是一种用于多分辨率神经网络的一般体系结构，也是基于此体系结构进行成像应用的框架。我们的基于坐标的网络在空间和规模上都是连续的，因为它们由多个阶段组成，这些阶段逐渐增加了更细节。除此之外，它们是一个紧凑而有效的表示。我们展示了多分辨率图像表示以及用于纹理放大和缩小以及抗脉化的应用。

translated by 谷歌翻译

An Evolutionary Approach for Creating of Diverse Classifier Ensembles

Alvaro R. Ferreira Jr , Fabio A. Faria , Gustavo Carneiro , Vinicius V. de Melo

分类：计算机视觉

2022-08-23

分类是数据挖掘和机器学习领域中研究最多的任务之一，并且已经提出了文献中的许多作品来解决分类问题，以解决多个知识领域，例如医学，生物学，安全性和遥感。由于没有单个分类器可以为各种应用程序取得最佳结果，因此，一个很好的选择是采用分类器融合策略。分类器融合方法成功的关键点是属于合奏的分类器之间多样性和准确性的结合。借助文献中可用的大量分类模型，一个挑战是选择最终分类系统的最合适的分类器，从而产生了分类器选择策略的需求。我们通过基于一个称为CIF-E（分类器，初始化，健身函数和进化算法）的四步协议的分类器选择和融合的框架来解决这一点。我们按照提出的CIF-E协议实施和评估24种各种集合方法，并能够找到最准确的方法。在文献中最佳方法和许多其他基线中，还进行了比较分析。该实验表明，基于单变量分布算法（UMDA）的拟议进化方法可以超越许多著名的UCI数据集中最新的文献方法。

translated by 谷歌翻译

Differential Geometry for Neural Implicit Models

Tiago Novello , Guilherme Schardong , Luiz Schirmer , Vinicius da Silva , Helio Lopes , Luiz Velho

分类：机器学习

2022-01-23

我们引入了一个神经隐式框架，该框架利用神经网络的可区分特性和点采样表面的离散几何形状，以将它们作为神经隐含函数的级别集近似。为了训练神经隐式函数，我们提出了近似签名距离函数的损失功能，并允许具有高阶导数的术语，例如曲率的主要方向之间的对齐方式，以了解更多几何细节。在训练过程中，我们考虑了基于点采样表面的曲率的不均匀采样策略，以优先考虑点更多的几何细节。与以前的方法相比，这种抽样意味着在保持几何准确性的同时更快地学习。我们还介绍了神经表面（例如正常矢量和曲率）的分析差异几何公式。

translated by 谷歌翻译

Combining Learning from Human Feedback and Knowledge Engineering to Solve Hierarchical Tasks in Minecraft

Vinicius G. Goecks , Nicholas Waytowich , David Watkins , Bharat Prakash

分类：机器学习 | 人工智能

2021-12-07

利益的现实世界任务通常由人类可读描述定义不足，并且没有预定义的奖励信号，除非它由人类设计师定义。相反，数据驱动的算法通常旨在解决特定的，狭义定义的任务，具有驱动代理学习的性能度量。在这项工作中，我们提出了赢得第一名的解决方案，并获得了2021个神经潮端竞赛Minerl Basalt挑战的最人性化的代理：从Minecraft中的人力反馈中学习，该参与者使用人类数据来解决仅限定义的四个任务通过自然语言描述，没有奖励功能。我们的方法使用可用的人类演示数据来培训仿制学习策略，以便导航和额外的人机反馈来训练图像分类器。然后将这些模块与估计的内径型图一起组合到基于人类的人类知识设计的状态机，该任务在自然等级中断和控制学习代理应该在任何瞬间遵循的宏观行为的控制中。我们将这种混合智能方法与端到端机器学习和纯工程解决方案进行比较，然后由人类评估符判断。 CodeBase可在https://github.com/viniciusguigo/kairos_minerl_basalt上获得。

translated by 谷歌翻译